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Abstract 

A supermassive black hole ejected from the center of a galaxy by gravitational wave recoil carries a retinue 
of bound stars - a "hypercompact stellar system" (HCSS). The numbers and properties of HCSSs contain 
information about the merger histories of galaxies, the late evolution of binary black holes, and the distribution 
of gravitational-wave kicks. We relate the structural properties (size, mass, density profile) of HCSSs to the 
properties of their host galaxies and to the size of the kick, in two regimes: collisional (Mbh lO^M©), i.e. 
short nuclear relaxation times; and collisionless (Mbh lO^Mp)), i.e. long nuclear relaxtion times. HCSSs are 
expected to be similar in size and luminosity to globular clusters but in extreme cases (large galaxies, kicks 
just above escape velocity) their stellar mass can approach that of ultra-compact dwarf galaxies. However they 
differ from all other classes of compact stellar system in having very high internal velocities. We show that 
the kick velocity is encoded in the velocity dispersion of the bound stars. Given a large enough sample of 
HCSSs, the distribution of gravitational-wave kicks can therefore be empirically determined. We combine a 
hierarchical merger algorithm with stellar population models to compute the rate of production of HCSSs over 
time and the probability of observing HCSSs in the local universe as a function of their apparent magnitude, 
color, size and velocity dispersion, under two different assumptions about the star formation history prior to 
the kick. We predict that ~ 10^ HCSSs should be detectable within 2 Mpc of the center of the Virgo cluster 
and that many of these should be bright enough that their kick velocities (i.e. velocity dispersions) could be 
measured with reasonable exposure times. We discuss other strategies for detecting HCSSs and speculate on 
some exotic manifestations. 
Subject headings: 



1. INTRODUCTION 

A natural place to search for supermassive black holes 
(SMBHs) is at the centers of galaxies, where they pre- 
sumably are born and spend most of their lives. But 
it has become increasingly clear that a SMBH can 
be violently separated from its birthplace as a result 
of linear momentum imparted by gravitational waves 
during stro n g-field int eractions with other SMBH s 
dPeres 19621: iBekenstein 1973t iRedmount & Rees 19891) . 
The largest net recoils are produced from configurations 
that bring the two holes close enough together to coa- 
lesce. Kick velocities following coalescence can be as 
high as ^ 200 km s~ ' in the case of non spinning holes 
(iGonzalez et al. 2007aL ISopuerta et al. 2007h : ~ 4000 km 
s^' for maxi mally spin ning, equal rnass BHs on initially cir- 
cular orbits dCa mpanelli et al. 2007; Gonzalez et al. 2007b; 
Herrmann et al. 2007: Pollnev 2007; Tichv and Marronetti : 
BrUgmann e t al. 2008; Dain et al. 2008; Ba ker et al. 2008h : 
and even higher, ^ 10,000 km s ~\ for b lack holes that 
approach on nearly-unbound orbits (iHealy et al. 2008). Since 
escape velocities fror n the centers of even the largest galaxies 
are <, 2000 km s"^ jMerritt et al. 2004) . it follows that the 
kicks can in principle remove SMBHs completely from their 
host gala xies. W hile such extreme events may be relatively 
rare (e.g. ISchni ttman & Buonanno 2007; Schnittman 2007), 
recoils large enough to displace SMBHs at least tem- 



porarily from galaxy cores - to distances of several 
hundred to a few thousand parsecs - may be much more 
common ( Merritt et al. 2004; Madau and Ouataert 2004 
lGualandris"& Merritt 2008; .Komossa & Merritt 2008b,) . 

Komossa et al. (2008) reported the detection of a re- 
coil candidate. This quasar exhibits a kinematically off- 
set broad-line region with a velocity of 2650 km s^' , 
and very narrow, restframe, high-excitation emission lines 
which lack the usual ionization stratification - two key 
signature s of kicks. In a d dition to spectrosco pic sig- 
natures (iMerritt et al. 2006bt iBonning et al. 2007h . recoil- 
ing SMBHs could be dete cted by the i r soft X-ray, UV 
and IR flaring dS hTelds & Bon ning 20081 iLippai et al. 20081: 
ISchnittman and Krolik 2008.) resulting from shocks in the ac- 
cretion disk surrounding the coalesced SMBH. Detection of 
recoiling SMBHs in this way is contingent on the presence 
of gas. But only a small fraction of nuclear SMBHs ex- 
hibit signatures associated with gas accretion, and a SMBH 
that has been displaced from the center of its galaxy will 
only shine a s a quasar until its bound gas has been used up 
dLoeb 20071) . The prospect that the SMBH will encounter and 
capture signifi cant amounts of gas on its way out are small 
jKapoor 19761) . 

A SMBH ejected from the center of a galaxy will 
always carry with it a retinue of bound stars. The 
stars can reveal themselves via tidal disruption flares or 
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via accretion of gas from stellar winds onto the SMBH 
(iKomossa & Merritt 2008aL hereafter Paper 1). The cluster of 
stars is itself directly observable, and that is what we discuss 
in the current work. The linear extent of such a cluster is fixed 
by the magnitude Vk of the kick velocity and by the mass of 
the SMBH: 



rk = GMBH/Vk 



^ 0.043 pc 



V IO^Mq J V 103 km s 
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Reasonable assumptions about the density of stars around 
the binary SMBH prior to the kick (Paper 1) then imply a total 
luminosity of the bound population comparable to that of a 
globular star cluster. 

In this paper we discuss the properties of these "hyper- 
compact stellar systems" (HCSSs) and their relation to host 
galaxy properties. Our emphasis is on the prospects for de- 
tecting such objects in the nearby universe at optical wave- 
lengths, and so we focus on the properties that would distin- 
guish HCSSs from other stellar systems of comparable size 
or luminosity. As noted in Paper 1, a key signature is their 
high internal velocity dispersion: because the gravitational 
force that binds the cluster comes predominantly from the 
SMBH, of mass 10*' < Mbh/Mq < 10^ stellar velocities will 
be much higher than in ordinary stellar systems of compara- 
ble luminosity. Other signatures include the small sizes of 
HCSSs (unfortunately, too small to be resolved except for 
the most nearby objects); their high space velocities (due to 
the kick); and their broad-band colors, which should resem- 
ble more closely the colors of galactic nuclei rather than the 
colors of uniformly old and metal-poor systems like globular 
clusters. 

As we discuss in more detail below (§2), a remarkable prop- 
erty of HCSSs is that they encode, via their internal kinemat- 
ics, the velocity of the kick that removed them from their host 
galaxy. A measurement of the velocity dispersion of the stars 
in a HCSS is tantamount to a measurement of the amplitude 
of the kick - independent of how long ago the kick occurred; 
the black hole mass; and the space velocity of the HCSS at 
the moment of observation. This property of HCSSs opens 
the door to an empirical determination of the distribution of 
gravitational- wave kicks. 

The outline of the paper is as follows. §2 derives the re- 
lations between the structural parameters of HCSSs- mass, 
radius, and internal velocity dispersion - given assumed val- 
ues for the slope and density normalization of the stellar 
population around the SMBH just before the kick. In §3, 
models for the evolution of binary SMBHs are reviewed and 
their implications for the pre-kick distribution of stars are de- 
scribed. These results, combined with the relations derived in 
§2, allow us to relate the structural parameters of HCSSs to 
the global properties of the galaxies from which they were 
ejected. §4 discusses the effect of post-kick dynamical evo- 
lution of the HCSSs on their observable properties. Stellar 
evolutionary models are used to predict the luminosities and 
colors of HCSSs and their post-kick evolution in §5, and in 
§6, the evolutionary models are combined with models of hi- 
erarchical merging to estimate the number of HCSSs to be 
expected per unit volume in the local universe as a function 
of their observable properties. §7 discusses search strategies 
for HCSSs and various other observable signatures that might 
be uniquely associated with them. In §8 we briefly discuss 
the inverse problem of reconstructing the distribution of re- 



coil velocities from an observed sample of HCSSs. §9 sums 
up and suggests topics for further investigation. 

2. STRUCTURAL RELATIONS 

In what follows, we adopt the Mbh — relation in the form 
given by Ferrarese & Ford (2005): 
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with o the 1-D velocity dispersion of the galaxy bulge. The 
influence radius of the SMBH is defined as 
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and rk w (o/Vk)^nnfl- 



2.1. Bound Population 

As discussed in Paper 1, a recoiling SMBH carries with it a 
cloud of stars on bound orbits. Just prior to the kick, most of 
the stars that will remain bound lie within a sphere of radius 
- rk around the SMBH (eq. [B- Setting Mbh 3 x IO^Mq 
and Vk = 4000 km s^' gives rk ~ 10^^ pc as an approximate, 
minimum expected value for the size of a HCSS; such a small 
size justifies the adjective "hypercompact". The largest values 
of rk would probably be associated with HCSSs ejected from 
the most massive galaxies, containing SMBHs with masses 
Mbh ~ 3 X lO^M© and travelling with a velocity just above 
escape, ^ 2000 km s^ ' ; this implies rk « several pc - similar 
to a large globular cluster. 

Assuming a power law density profile before the kick, 
p(r) = p(ro)(r/ro)^'', the stellar mass Mk initially within ra- 
dius rk is 

47r 

Mk = M(r<rk) = - p(rk)r3 (4a) 
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where po = p(ro). As a fiducial radius at which to normalize 
the pre-kick density profile, we take ro = r,, defined as the 
radius containing an integrated mass in stars equal to twice 
Mbh- (We expect r, to be of order ring; see §3 for a further 
discussion.) Equation (|4| then becomes 

3-Y 

(5) 



Mk = 2Mbh I — — 



V r.Vi J 

After the kick, the density profile will be nearly unchanged 
at r < rk but will be strongly truncated at larger radii. We 
define Mb to be the total mass in stars that remain bound to 
the SMBH after the kick, and write 



f - pMfGMBHV ^ 



-2(3-Y) 



where 



2Mb=Fi(Y)Mk. 



(6a) 
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(7) 



Kicks large enough to remove a SMBH from a galaxy 
core must exceed a, and escape from the galaxy implies 
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Fig. 1. — Dimensionless factors that describe (a) the stell ar mass bound 
to a kicked SMBH (eq.|6) and (b) its effective radius (eq.[l7)- Thick (black) 
line in the upper panel is the exact expression derived in Appendix A and 
thin (black) line is the approximation, given in eq.[8] Open circles in (b) were 
computed using a Monte-Carlo algorithm. Dash ed (b lue) lines in both panels 
show the Dehnen-model approximations of eqs. (15) and il8\ . 



(Vk/o)^ ^ 1; hence ^ nnfi to a good approximation. It fol- 
lows that stars that remain bound following the kick will be 
moving essentially in the point-mass potential of the SMBH 
both before and after the kick. To the same order of approxi- 
mation, the SMBH's velocity is almost unchanged as it climbs 
out of the galaxy potential well (at least during the relatively 
short time required for the stars to reach a new steady state 
distribution after the kick). Finally, since the bulk of the 
recoil is imparted to the SMBH in a time ~ GMbh/c^, the 
kick is essentially instantaneous as seen by stars at distances 
r ^ GMbh/c^ < (Schnittman et al. 2008). 

These three approximations allow the properties of the 
bound population to be computed uniquely given the initial 
distribution (Paper I). Transferring to a frame moving with 
velocity Vk after the kick, the stars respond as if they had re- 
ceived an implusive velocity change — Vk at the instant of the 
kick, causing the elements of their Keplerian orbits about the 
SMBH to instantaneously change. As a result, all initially- 
bound stars outside of the sphere r — 8rk at the moment of 
the kick acquire positive energies with respect to the SMBH 
and escape. Some of the stars initially at rk ^ r < Shc escape 
while others remain bound. The stellar distribution at r rk 
is almost unchanged. 

Appendix A presents a computation of the bound mass un- 
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Fig. 2. — Evolution of the bound population following a kick; the kick was 
in the —X direction at ? = 0. Each frame is centered on the (moving) SMBH. 
Stars were initially distributed as a power law in density, p « r^^/"*; only stars 
which remain bound following the kick are plotted. Unit of length is and 
frames (a,b,c,d) correspond to times of (0, 10.30, 100) in units of GMqu/V^. 

der these approximations and gives an expression for Fi (y) m 



terms of integrals of simple functions. Figure [1^ plots this 
expression, and also the function 

Fi(y) = 11.67-'", (8) 

which is seen to be an excellent approximation for 
0.7 < Y< 2.5. (As a check, we computed Fi(y) in another 
way: we generated Monte-Carlo samples of positions and ve- 
locities corresponding to an isotropic, power-law distribution 
of stars around the SMBH prior to the kick and discarded the 
stars that would be unbound after the kick.) Combining equa- 
tions ^ and ([8]), we get 
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Setting y = 1 in this expression gives 
/b « 2 X 10 
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(10) 



which reproduces reasonably well the values for the bound 
mass found by Boylan-Kolchin et al. (2004) in their A^-body 
simulations of kicked SMBHs; their galaxy models had cen- 
tral power-law density cusps with y = 1 . 

Setting y== 1.75, the value corresponding to a coUisional 
(Bahcall-Wolf) cusp, gives 

/ \ 1.25 / \ -1.25 / T7 \ -2.5 

-3 / Mbh 



/b«5xlO- 



V lO^Mr: 



r, 

10 pc 



Vk 



10^ km s" 



(11) 

which will be useful in what follows. 

Given the elements of the Keplerian orbits after the kick, the 
subsequent evolution of the stellar distribution can be com- 
puted by simply advancing the positions in time via Kepler's 
equation. (Alternately the stellar trajectories can be brute- 
force integrated; both methods were used as a check.) Fig- 
ure |2] shows how the bound population evolves from its ini- 
tially spherical configuration, into a fan-shaped structure at 
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Fig. 3. — Steady state, spherically symmetrized density profiles of the 
bound population for Y= (1, 1.5,2). Dotted lines show the pre-kick densities; 
dashed (blue) lines are Dehnen-model fits. 

t w 10(GMbh/V]j ), and finally into a reflection-symmetric, 
elongated spheroid with major axis in the direction of the kick 
at t sa IQQ{GMbu/vI). The latter time is 



^sym 



' 3 X 10 V 
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-1 



-3 



(12) 



during which interval the SMBH would travel a distance 
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10^ km s" 
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Observing the kick-induced asymmetry would only be possi- 
ble for a short time after the kick; however the elongation of 
the bound cloud at r ^ Hc would persist indefinitely. 

In general, the galactic nucleus might be elongated before 
the kick, and its major axis will be oriented in some random 
direction compared with Vk- Since the stellar distribution at 
r ^ rk is nearly unaffected by the kick, the generic result will 
be a bound population that exhibits a twist in the isophotes at 
r rk and a radially-varying ellipticity. 

Continuing with the same set of approximations made 
above, we can compute the steady-state distribution of the 
bound population by fixing the post-kick elements of the Ke- 
plerian orbits and randomizing the orbital phases (or equiva- 
lently by continuing the integration of Fig.|2]until late times.) 
The resultant density profiles are shown in Figure [3] for y — 
(1, 1.5,2). Beyond a few rk, the spherically-symmetrized den- 
sity falls off as ^ r^"*; the stars in this extended envelope move 
on eccentric orbits that were created by the kick. 

It turns out that Dehnen's (1993) density law: 



(14) 



is a good fit to these density profiles for 1 ^ y ^ 2, if td is set 
to 2.0rk; here Md is the total (stellar) mass. Figure |3] shows 
the Dehnen-model fits as dashed lines. Using the expressions 
in Dehnen (1993), it is easy to show that the Dehnen models 
so normalized satisfy 



(15) 



implying F\ w 2"^^^. This alternate expression for F\ is plotted 
as the dashed line in Figure [T^. Unless otherwise stated, we 
will use equation ([8]) for F\ in what follows. 

So far we have assumed that stars remaining bound to the 
SMBH experience only its point-mass force. In reality, be- 
yond a radius of order nnfi ~ (Vk/c^)^nc, stars will also feel a 
significant acceleration from the combined attraction of the 
other stars, leading to a tidally truncated density profile at 
r ^ rk. We ignore that complication in what follows. 

We note that r, is determined by the density of stars just 
before the massive binary has coalesced, and may be substan- 
tially different from rinfl (eq.[3]i. In the next section we discuss 
predictions for r, based on a number of models for the evolu- 
tion of the massive binary prior to the kick. 

Before doing so, we first present the mass-radius and mass- 
velocity dispersion relations for the bound population, ex- 
pressed in terms of r, as a free parameter. 

2.2. Mass-Radius Relation 
Combining equations ([T]| and we get 



Mb 
Mbh 



3-y 



(16) 



As a measure of the size of the HCSS, the effective radius 
Teff, i.e. the radius containing one-half of the stellar mass in 
projection, is preferable to rk. We define a second form factor 
F2 such that 

reS^F2ijW. (17) 

Figure [TJb) plots f2(y)- Also shown by the dashed line is the 
relation corresponding to the Dehnen-model approximation 
described above, for which 



F2(y)« 1.5 (2^/(3-^) -1) 



-1 



(18) 



(Dehnen 1993). The Dehnen model approximation is reason- 
ably good for all y in the range 0.5 < y < 2.5 and will be used 
as the default definition for F2 in what follows. 

Combining equations (fTSI l and (fTSb gives the mass-radius 
(Mb — Teff) relation for HCSS's, in terms of the (yet unspeci- 
fied) r,: 



Mb=/:(y)MBHrrAfff^ 
K(y) = 11.6y-'-^^ [(3/2) (2 '/t^-^' - 1)- 
fory= (1, 1.5,2), (0.89,1.41,2.32). 



(19a) 
'; (19b) 



2.3. Mass-Velocity Dispersion Relation 

Stars bound to a recoiling SMBH move within the point- 
mass potential of the SMBH, for which the local circular ve- 
locity is (GMbh/'")''^^- The circular velocity at r = rk is just 
Vk, so the characteristic (e.g. rms) speed of stars in the bound 
cloud scales as Vk, motivating us to define a third form factor 
Fi such that 

Oobs=^3(Y)Vk, (20) 

where aobs is the measured velocity dispersion. To the extent 
that y is known, and/or the dependence of on y is weak, it 
follows that the amplitude of the initial kick can be empiri- 
cally determined by measuring the velocity dispersion of the 
stars. 

An integrated spectrum will include stars at all (projected) 
radii within the spectrograph slit. (E.g. at the distance of the 
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Fig. 4. — Line-of-sight distribution of velocities of stars bound to a re- 
coiling SMBH, as seen from a direction perpendicular to the kick. Initially 
p«r"1',Y={l,2); the phase-space distribution following the kick was com- 
puted as in Paper I. Solid curves show N{V) as defined by all bound stars 
(thick) and progressively thinner curves show N{V) defined by bound stars 
within a projected distance of (10, 3. l)ri; from the SMBH. Dashed (blue and 
red) curves show Gaussian distributions with c = (0.2,0.5)Vjs (Y = 1) and 
a = (0.35,0.75)1^ (J= 2) respectively. 

Virgo cluster, a 1" slit corresponds to ^ 80 pc, larger than rgff 
for even the largest HCSS's.) Since V ^ the distribution 
N{V) of line-of-sight velocities of stars within the slit will 
contain significant contributions from stars moving both much 
faster and much slower than Vt and can be significantly non- 
Gaussian'. 

Figure |4] shows N{V) for bound clouds with y = 1 and 2, 
as seen from a direction perpendicular to the kick. (This is 
the a priori most likely direction for observing a prolate ob- 
ject. Since the HCSS is nearly spherical within a few r^, the 
results cited below depend weakly on viewing angle.) Since 
more than 1/2 of the stars lie at r > 2rk and are moving with 
V < Vk, the central core of the distribution has an effective 
width that is much smaller than y^; most of the information 
about the high velocity stars near the SM BH is contained in 
the extended wings (e.g. lvan der Mareri 994). 

Velocity dispersions of stellar systems are typically mea- 
sured by comparing an observed, absorption line spectrum 
with template spectra that have been broadened with Gaus- 

' Integrated spectra of the centers of galaxies typically are well modelled 
via Gaussian broadening functions. This is because most of the light in the 
slit comes from stars that are far from the SMBH. 




X [A] 

Fig. 5. — Absorption line spectrum of the KOIII star HR 7615, convolved 
with two broadening functions. Thick (black) curx'e: N{V) from the top panel 
of Fig.|4] computed from the entire bound population, assuming 14 = 1000 
km s^' . Thin (hhie) curve: Gaussian N{'V) with c = 200 km s^' . 

sian A^(y)'s; the comparison i s either made directly in 
intensity-wavelength space ("e.g. iMorton & Chevalier 1973h 
or via cross-correlation (e.g. 'Simkin 1974'). For example, in- 
ternal velocities of UCDs (ultra-compact dwarf galaxies) in 
the Virgo and Fornax clusters have been determined in both 
ways (e.g. Hilker et al. 2007; Mieske et al. 2008). Figure|5] 
shows the results of broadening the spectrum of a KO star in 
the Call ti-iplet region (8400A< A. < 8800 A), with two broad- 
ening functions: NiV) from the top panel of FigurelH scaled 
to Vk = 1000 km s"', and a Gaussian N{V) with a = 200 km 
s^ ' . The two broadening functions produce similar changes in 
the template spectrum; the NiV) from the bound cloud gener- 
ates more 'peaked' absorption lines, but this difference would 
be difficult to see absent very high quality data. 

We computed the best-fit, Gaussian o corresponding to the 
various broadening functions in Figure |4] as a function of V^. 
The stellar template of Figure|5]was convolved with Gaussian 
A^(y)'s having a in the range 2 to 2000 km s^'and a step 
size of 1 km s^'. Each of the Gaussian-convolved templates 
was then compared with the simulated HCSS spectrum, and 
the "observed" velocity dispersion Gobs was defined as the o 
for which the Gaussian-convolved template was closest, in a 
least-squares sense, to the HCSS spectrum. No noise was 
added to either the HCSS or comparison spectra. 

Figure|6]shows the results for y= (1,2), 150 km s^' < < 
4000 km s^', and for (circular) apertures of various sizes. 
When the entire HCSS is included in the slit, Oobs ~ O.lSVk 
(y = 0.5), « 0.20yk (y = 1), and « 0.35yk (y = 2). These val- 
ues are well fit by the ad hoc relation 



lnF3 = -2.17 + 0.56y, Q.5<,y<,2. 



(21) 



As the aperture is narrowed, Oobs increases to values closer to 
Vk, although as argued above, realistic slits would be expected 
to include essentially the entire HCSS and we will assume this 
in what follows. 

We note that some ultra-compact dwarf galaxies (UCDs) 
have Gobs as large as 40 — 50 km s^'and that the implied 
masses are difficult to reconcile with simple stellar population 
models, which has led to suggestions that the UCDs are dark- 
matter dominated (Hilker et al. 2007; Mieske et al. 2008). 
Alternatively, some UCD's might be bound by a central black 
hole; for instance, an observed o of 50 km s^'is consistent 
with an HCSS produced via a kick of- 250 km s-i(y= 1). 
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Fig. 6. — Velocity dispersions Oobs that would be inferred from broadened 
absoiption-line spectra of HCSS's. Solid (black) lines: 7=1; dashed (blue) 
lines: y = 2, where y is the power-law index of the stellar density profile 
before the kick. Thick curves correspond to all bound stars; thinner curves 
correspond to an observing aperture that includes only bound stars within a 
projected distance lOri^ and 3ri[ from the SMBH (as viewed from a direction 
perpendicular to the kick). Dotted lines show Oobs = 0.2Vk and Oots = 0.35Vi;- 

Detection of the high-velocity wings in N{V) (Fig.|4|l could 
distinguish between these two possibilities. 

W hile spectral deco nvolutio n schemes ex ist that can do this 
(e.g.l Saha& Williams 1994;.Merritt 1997h . they require high 
signal-to-noise ratio data. Precisely how high is suggested 
by Figure |7] which shows the results of simulated recovery 
of HCSS broadening functions from absorption line spectra. 
The spectrum of Figure|5]was convolved with the y = 1 N{V) 
plotted in Figure |4] with — 10^ km s^'. Noise was then 
added to the broadened spectrum (as indicated in the figures 
by the signal-to-noise ratio S/N) and the broadening function 
was recovered via a non-parametric algorithm (Merritt 1997); 
confidence bands were constructed via the bootstrap. Figure]?] 
suggests that S/N« 40 permits a reasonably compelling de- 
termination of a non-Gaussian N{V). This conclusion is re- 
inforced by the inferred values of the Gauss-Hermite (GH) 
moments Oq and h4; the former measures the width of the 
Gaussian term in the GH expansion of N{V) while h4 mea- 
sures symmetric deviations from a Gaussian. For S/N= 40, 
the recovered /z4 — 0.18 ±0.1 (90%), significantly different 
from zero. (We note that the velocity dispersion correspond- 
ing to the GH expansion is Oc = (1 + V6/i4)cJo which is close 
to Oobs as defined above.) In §7 we discuss the feasibility of 
obtaining HCSS spectra with such high S/N. 

Combining equations ( fTTb and ( l20b . the (stellar) mass- 
velocity dispersion (Mb — Oobs) relation for HCSS's becomes 



M, 



BH 







/GMbh 



r. 



3-7 



2(7-3) 
■*obs 



Mq 

(22) 

It is tempting (though only order-of-magnitude correct) to 
write r, w rinfl ~ GMbh/o^, which allows equation (l22l l to be 
written 

Mbh V o y 



N(V) 



N(V) 



N(V) 




-1000 



-500 



500 



1000 



V [km/s] 



Fig. 7. — Recovery of HCSS broadening functions from simulated absorp- 
tion line spectral data with various amounts of added noise. Blue lines are 
the input N{V) (from Figure|4] with 7= 1 and Vj^ = 10^^ km s"'). Solid hnes 
are the recovered N{V)s and dash-dotted lines are 90% confidence bands. Oq 
and /14 are coefficients of the Gauss-Hermite fit to the recovered N{V); 90% 
confidence intervals on the parameters are given. 

The dimensionless coefficient in these two expressions is 
equal to (3 x 10-^1.2x 10-^,0.4) fory= (0.5,1,2). 



3. THE PRE-KICK STELLAR DENSITY 

While the linear extent of a HCSS is determined entirely by 
Mbh and Vk (eq. [U, its luminosity and (stellar) mass depend 
also on the density of stars around the SMBH (i.e. around 
the massive binary) just prior to the kick. In this section we 
discuss likely values for the parameters that determine the pre- 
kick density of stars near the SMBH and the implications for 
the mass that remains bound after the kick. In a following 
section we will relate mass to luminosity and color 

Two inspiralling SMBHs first form a bound pair when their 
separation falls to ~ r^n = GMbh /o, the influence radius of 
the larger hole. This distance is a few parsecs in a galaxy like 
the Milky Way. The separation between the two SMBHs then 
drops very rapidly (on a nuclear crossing time scale) to a frac- 
tion ^ O.IM2/M1 of rrinfl as the binary kicks out stars on inter- 
secting orbits via the gravitational slingshot (Merritt 2006a). 
Because a massive binary tends to lower the density of stars 
or gas around it, the two SMBHs may stall at this separation, 
never coming close enough together ( ^ 10^^ pc) that gravita- 
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tional wave emission can bring them to full coalescence. This 
is the "final parsec problem." 

Of course, in order for a kick to occur, the two SMBHs 
must coalesce, and in a time shorter than ^10 Gyr Roughly 
speaking, this requires that the density of stars or gas near 
the binary remain high until shortly before coalescence. This 
implies, in turn, a relatively large mass in stars that can remain 
bound to the SMBH after the kick, hence a relatively large 
luminosity for the HCSS that results. 

Converting these vague statements into quantitative esti- 
mates of the stellar density just before the kick requires a de- 
tailed model for the joint evolution of stars and gas around the 
shrinking b inary. A number of such rn odels have been dis- 
cussed (see iGualandris & Merritt 2008L for a review). Here 
we focus on the two that are perhaps best understood: 

• Collisional loss-cone repopulation. If the two-body re- 
laxation time fff in the pre-kick nucleus is sufficiently 
short, gravitational scattering between stars can con- 
tinually repopulate orbits that were depleted by the 
massive binary, allowing it to shrink on a timescale 
of ^ tR ( Yu 2002). This process can be accelerated 
if the nucleus contains perturbers that are significantly 
more massive than stars, e.g. giant molecular clouds 
dPerets & Alexander 2008h . Repopulation of depleted 
orbits guarantees that the density of stars near the bi- 
nary will remain relatively high as the binary shrinks. 

• Collisionless loss-cone repopulation. In non- 
axisymmetric (barred, triaxial or amorphous) galaxies, 
some orbits are "centrophilic," passing near the galaxy 
center each crossing time. This can imply feeding 
rates to a central binary as large as M ^ G^'o^ even 
in the absence of collisional loss-cone repopulation 
dMerritt & Poon 2004.) . Because the total mass on 
centrophilic orbits can be ^ Mbh, interaction of the 
binary with a mass ~ Mbh in stars need not imply a 
significiant decrease in the local density of stars, again 
implying a large pre-kick density near the binary. 

We now discuss these two pathways in more detail and their 
implications for the pre-kick stellar density near the SMBH. 

3.1. Collisional loss-cone repopulation 

At the end of the rapid evolutionary phase described above, 
the binary forms a bound pair with semi-major axis 

q nnfl 



(1+^)2 4 



(24) 



with q = M 2/M1 < 1 the binary mass ratio (e.g. 
iMerritt 2006ah . Stars on "loss cone" orbits that intersect 
the binary have already been removed via the gravitational 
slingshot by this time, and continued evolution of the binary 
is determined by the rate at which these orbits are repopulated 
- in this model, via gravitational scattering. Scattering onto 
loss-cone orbits around a central mass Mbh = Mi + M2 
occurs predominantly from stars on eccentric orbits with 
semi-major axes ~ ri^n, and the relevant relaxation time 
is therefore ^ f«(nnfi)- Relaxation times at r = rinfl in 
real galaxies are found to be wel l correlated with spheroid 
luminosities (e.g. Figure 4 of Merr itt et al. 2 007a). dropping 
below 10 Gyr only in low-luminosity spheroids - roughly 
speaking, fainter than the bulge of the Milky Way. Such 
spheroids have velocity dispersions <, 150 km s^' and contain 



Q. 

o O 



T 1 1 ' r 



Fig. 8. — Evolving stellar density around a binary SMBH of mass 
Ml +M2 = 10*M, .i in a spherical galaxy contai ning 10^ Solar-mass stars, in 
the "collisional loss cone repopulation" regime IMerritt e t al. 2007b). Solid 
lines show p(r) at five different times, between a(t) « aj, and fs . The 
density falls to zero at r a(t) and smaller values of a correspond to later 
times; total elapsed time is ~ 0.5?;f(rinfl) where is the gravitational influ- 
ence radius of the massive binary. Dotted line shows the initial (pre-binary) 
galaxy density and dashed line has the Bahcall-Wolf (1976) slope, p « r^'^^. 

SMBHs with masses <^ lO^M©. Binary SMBHs in more 
luminous galaxies might still evolve to coalescence via this 
mechanism, but only if they contain significant populations 
of perturbers more massive than ^ Mq, e.g. giant molecular 
clouds or intermediate mass black holes; Perets & Alexander 
(2008) have argued that this might generically be the case in 
the remnants of gas-rich galaxy mergers though this model is 
unlikely to work in gas-poor, old systems like giant elliptical 
galaxies. 

Denoti ng the semi-major axi s of the massive binary by a{t), 



one finds (IMerritt et al. 2007bh 



fR(nnfl) 



ilnf^ 
V a 



B 



(25) 



for fleq ^ a{t) <, flh, where Oeq ~ lO^^rinfl is the separation at 
which energy losses due to gravitational wave emission be- 
gin to dominate losses due to interaction with stars; (A,B) 
(0.016, 0.08) with only a weak dependence on binary mass ra- 
tio. The elapsed time between a = Oh and a = aeq is of order 

fR(nnfl)- 

Figure [8] shows the evolution of the stellar density around 
a massive binary as it shrinks from a « ah to a « a^q, the 
evolution was computed using the Fokker-Planck formalism 
described in Merritt et al. (2007b). The same gravitational 
encounters that scatter stars into the binary also drive the dis- 
tribution of stellar energies toward the Bahcall-Wolf (1976) 
"zero-flux" form, p ~ r^^/^, and on the same time scale, 
^ ffilnnfl); as a result, a high density of stars is maintained 
at radii a(t) <r< 0.2rinfl. In effect, the inner edge of the cusp 
follows the binary as the binary shrinks. 

Once a{t) drops below ^ Oeq, the binary "breaks free" 
of the stars and evolves rapidly toward coalescence, leav- 
ing behind a phase-space gap corresponding to orbits with 
pericenters Oeq- (In a similar way, evolution of a binary 
SMBH in response to gravitational waves and gas-dynamical 
torques leaves behind a gap in the gaseous accretion disk; 
Milosavljevic & Phinney 2005.) Gravitational scattering will 
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I°gi0 ^^b [^^Sun] 

Fig. 9. — Effective radius vs. bound stellar mass for HCSSs. Thick solid 
lines (blue hatched area) are based on the "coUisional" loss cone repopulation 
model and assume a galaxy central velocity dispersion of c = (50, 100, 150) 
km s^ ' (from left to light). Thin soHd lines (red hatched area) are based on the 
"collisionless" loss cone repopulation model; the three lines in each set as- 
sume a galaxy central velocity dispersion of a = (200, 300, 400) km s^ ' (right 
to left) and the three sets of Hnes are for 7 = 0.5 (black) ,1.0 (green) and 1 .5 
(orange). For both models, solid lines extend to a maximum r^ff based on 
the assumption that > 4.5o (escape from the galaxy) while dashed lines 
correspond to the weaker condition Vk > 2c (escape from the galaxy core). 
HCSSs to the left of the dash-dotted (magenta) line are expected to expand 
appreciably over their lifetime. Data points are from Forbes et al. (2008). 
Filled circles: E galaxies. Open circles: Ultra-compact Dwaii's (UCDs) and 
Dwarf-Globular Transition Objects (DGTOs). Stars: globular clusters. 

only partiall y refill this gap in the t ime between a — Ogq and 
coalescence (iMerritt & Wang 2005h . Figure |8] suggests that 
fleq ^ (flh, nnfl)- Merritt et al. (2007b) estimated, based on the 
same Fokker-Planck model used to construct Figure[8] that 



(0.20,0.67,2.3,7.8) x 10" 



(26) 



"eg 
Hnfl 

for equal-mass binaries with total mass Mj + M2 = 
(10^, 10*', 10^, 1O**)M0; the numbers in parentheses decrease 
by ~ 25% for binaries with M2 /Mi =0.1. 

Following the kick, the density profile of Figure [8] will be 
truncated beyond r w r^. The inner cutoff at r w Ogq satisfies 

^10-3(1^)'. (27) 



rk 



The requirement that Oeq < '"k - i e. that at least some stars re- 
main bound after the kick - then becomes Vt 30a, which is 
never violated by reasonable (VkiO) values. However, the in- 
ner cutoff exceeds O.lr^ for Vt ^ 10a, a condition that would 
be fulfilled for a = 100 km s^'and Vk ^ 10^ km s-\ In 
what follows we ignore the inner cutoff and assume that the 
Bahcall-Wolf cusp extends to r = 0. 
The pre-kick density can therefore be approximated as 



I Po 



-7/4 



^ r <; ro 



(28) 



where ro « 0.2rinfl and po = p(ro) is the density of the galaxy 
core. Using equations (|6]l and ( l28T l, the mass remaining bound 




Fig. 10. — Observed velocity dispersion vs. bound stellar mass for HCSSs. 
Thick solid lines (blue hatched area) are based on the "colHsional" loss cone 
repopulation model and assume a galaxy central velocity dispersion of a = 
(50, 100, 150) km s"'(from left to right). Thin solid lines (red hatched area) 
are based on the "collisionless" loss cone repopulation model; the three lines 
in each set assume a galaxy central velocity dispersion of a = (200, 300, 400) 
km s^' (right to left) and the three sets of lines are for y = 0.5 (black) , 1.0 
(green) and 1.5 (orange). For both models, solid lines extend to a minimum 
Oobs based on the assumption that Vi^ > 4.5a (escape from the galaxy) while 
dashed lines correspond to the weaker condition > 2c (escape from the 

Bsi*h]e'i9»a!ld3ttedsSM@H"af«emiM(H is then 

= ^i(y) 



Mb 
Mbh 



-y 



(29a) 



'1.3 



/GMBHV^VpO'-fnfl 



nflVkV 

5/2 



Vkj \Mbh 



(29b) 



(29c) 



where Mcore = PCinfli the last expression assumes < 
0.2rinfl, i.e. Vk ^ 2a, which is always satisfied for a HCSS 
that escapes the galaxy core. 

To the extent that the galaxy core was itself created by 
the massive binary during its r apid phase o f evolution, then 
Mcore/MBH is of Order unity (Merr itt 2006ah . (Following the 
kick, the core will expand still more; Gualandris & Merritt 
2008.) Making this assumption yields 



Mb 
Mbh 



Vk 



5/2 



(30a) 



:2x 10" 



5/2 



Vk 



-5/2 



(.30b) 



,200 km s"V VlO^kms" 
In the case of ejection from a stellar spheroid like that of the 
Milky Way (a « 100 km s"', Mbh « 4 x IO^'Mq), we have 



Mb: 



IO'^Mq ' 



Vk 



VlO^kms" 



-5/2 



I.e. 



3x10^^ Mb/M0 ;$5xlO'* 
4000 > yk/(km s-') > 500. 



(31) 

(32a) 
(32b) 
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Sersic Index 

Fig. 11. — Escape velocity from the center of a Sersic-law galaxy as a 
function of Sersic index n, in units of the central, projected, Id velocity dis- 
persion as measured through a circular' aperture. Constant mass-to-light ratio 
was assumed and the effect of the SMBH on the potential or on the motions 
of stars was ignored in computing Vesc and c The four curves (black, red, 
green, blue) coiTespond to aperture radii of (0.01,0.03,0.1,0.3) in units of the 
half-light radius of the galaxy. 

Combining equations ([T]), ( fTTb and ( l30b gives the mass- 
radius relation in the collisional regime: 



dGualandris & Merritt 20081) . Figures |9] andfTOl show the Hm- 
its on ^eff and Gobs corresponding to = 5 and N = 2, the 
latter via dashed lines. 

According to Figure |9] HCSSs in this "collisional" regime 
can have effective radii as big as ~ 1 pc when Vk > Vesc; on the 
Mb — Teff plane their distribution barely overlaps with glob- 
ular clusters, and extends to much lower sizes and (stellar) 
masses. However their velocity dispersions (Fig. [TOl i would 
always substantially exceed those of either globular clusters 
or compact galaxies of comparable (stellar) mass. 

3.2. Collisionless loss-cone repopulation 

By "collisionless" we mean that the nuclear relaxation 
time is so long that gravitational scattering can not refill the 
loss cone of a massive binary at a fast enough rate to sig- 
nificantly affect the binary's evolution after the hard-binary 
regime (eq. l24l l has been reached. The relevant radius at 
which to evaluate the relaxation time is ^ r\a^, the influ- 
ence radius of the binary (or of the single black hole that 
subsequently forms). The relaxation time at ria<A in ellip- 
tical galaxies is fou nd to correlate tightly with o or Mbh 
dMerritt et al. 2007bl) : 



on in9 f 

8.0 X 10 yr ' 



V lO^Mp 



1.54 



(37) 



Mb«0.4G-V4M^yV/2rf/4 



; 4 X IQ^Mq 



• 5 X 10%p 



a 



-1/4 



100 km s 



-1 



5/2 



'"eff 



0.1 pc 



(33a) 

\ 5/4 

<$3b) 



100 km s 



-1 



1.29 



>"eff 

0.1 pc 



5/4 



(33c) 



where the final expression assumes the Mbh — relation in 
equation (|2]l. 

The mass-velocity dispersion (Mb — Oobs) relation for HC- 
SSs follows from equation (O with Oobs = ^3 (y) Vk ~ 0.30yk: 



Mb«0.05MBHO^/^o, 



,5/2^-5/2 



obs 



',1 X 10^M,T, 



100 km s" 



7.4 



Oobs 



100 km s" 



(34a) 

-5/2 

(34b) 



where the Mbh — O relation has again been used. 

Figures |9] and [TO] plot the relations ( |33] ). (l34l i for a = 
(50, 100, 150) km s^^ Plotted for comparison are samples 
of globular clusters and dwarf galaxies from the compilation 
of Forbes et al. (2008). 



In these plots, the minimum rgff is presumed to be that 
related with a kick of ^ 4000 km ^ . This condition (c( 



sociated ■ 

bined with the Mbh — O relation) gives 



?-eff ^3.0x lO^^pc 



100 km s 



-1 



4.86 



com- 



(35) 



The maximum reff is associated with the smallest Vk that is of 
physical interest. We express this value of Vk as N<5 which 
allows us to write 



'•eff ^4.9pcA^ 



-2 



100 km s" 



2.86 



(36) 



Kicks large enough to eject a SMBH completely from its 
galaxy have N 5 (Figure \TT[ . Kicks just large enough 
to remove a SMBH from the galaxy core have N ^ 2 



where Solar-mass stars have been assumed. A mass of order 
Mbh is scattered into the central sink in a time ti{{ri„fi), and 
this is also roughly the mass that must interact with the binary 
in order for it to shrink by a factor of order unity. Even al- 
lowing for variance in the phenomenological relations ( [37] l, it 
follows that collisional loss cone refilling is unlikely to sig- 
nificantly affect the evolution of a binary SMBH in galaxies 
with a ;> 200 km s^'orMBH k, IO^Mq. 

An alternative pathway exists for stars in these galax- 
ies to interact with a central binary. If the large-scale 
galaxy potential is non-axisymmetric, a certain fraction of 
the stellar orbits will have filled centers - these are the 
(non-resonant) box or centrophilic orbits, which are typically 
chaotic as well due to th e presence of the central point mass 
dMerritt & Valluri 19991) . Stars on centrophilic orbits pass 
near the central object once per crossing time; the number 
of near-center passages that come within a distance d of the 
central objec t, per unit of time, is found to scale roughly lin- 
early with d (iGerhard & Binnev 1985l;lMerritt & Poon 20 04). 
allowing the rate of supply of stars to a central object to be 
computed simply given the population of centrophilic orbits. 
While the latter is not well known for individual galaxies, sta- 
ble, self-consistent triaxial galaxy models with central black 
holes can b e constructed with ch aotic orbit fractions as large 
as ^ 70% (iPoon & Merritt 200 4*). Placing even a few percent 
of a galaxy's mass on centrophilic orbits is sufficient to bring 
two SMBHs to coalescence in 10 Gyr (iMerritt & Poon 2 004). 
Furthermore, the effect of the binary on the density of stars in 
the galaxy core is likely to be small, since the mass associated 
with centrophilic orbits is ^ Mbh and stars on these orbits 
spend most of their time far from the center 

Here, we make the simple assumption that the observed 
core structure of bright elliptical galaxies is similar to what 
would result from the decay and coalescence of a binary 
SMBH in the collisionless loss-cone repopulation model. In 
other words, we assume that the binary SMBHs that were 
once present in these galaxies did coalesce, and the cores 
that we now see are relics of the binary evolution that pre- 
ceded that coalescence. By making these assumptions, we are 
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probably underestimating the density around a SMBH at the 
time of a kick, since som e observed cores will have b een en- 
larged by the kick itself (iGualandris & Merritt 20081) . Also, 
core sizes in local (spatially resolve d) galaxies are likely to 
reflect a series of past merger events ( Merritt 2006a"); SMBHs 
that recoiled during a previous generation of mergers would 
probably have carried a higher density of stars than implied 
by the current central densities of galaxies. 

Above we characterized the pre-kick mass density as p o= 
r^^, with r, the radius at which the enclosed stellar mass 
equals twice Mbh- We computed y and r, for a subset of ea rly- 
type galaxies in the ACS Virgo sample (ICote et al. 20041) for 
which o was known; for some of these galaxies the SMBH 
mass has been measured dynamically while Mbh in the re- 
maining galaxies was computed from equation (|2|i. Each 
galaxy was mo delled with a PSF-co nvolved, core-Sersic lumi- 
nosity profile jGraham et al. 20()3b . which assumes a power 
law relation between luminosity density and projected radius 
inside a break radius R\,. The core-Sersic fits were numeri- 
cally deprojected, and converted from a luminosity to a mass 
density as in Ferrarese et al. (2006). The radius r, was then 
computed from 



o 
o 



3-yMj 



i/(3-y) 



BH 



PO'-Q 



with y the central power-law index of the deprojected density; 
ro is a fiducial radius smaller than /?b which we chose to be 1 
pc and po is the mass density at r = ro. 

Figure[T2]shows the relation between r, and a and between 
r, and Mbh- "Core" galaxies (those with projected profiles 
flatter than L R^^-^ near the center) are plotted with open 
circles and "power-law" galaxies (E steeper than R^'^'^) galax- 
ies as filled circles. While this distinction is somewhat arbi- 
trary. Figure [T2] confirms that the "core" galaxies have larger 
r, at given o or Mbh than the "power-law" galaxies, consis- 
tent with the idea that the central densities of "core" galaxies 
have been most strongly affected by mergers. The best-fit re- 
lations defined by the core galaxies alone are 

logio('-./pc) = -4.84 + 2.84 logio(o/km s"') (39a) 
= -2.92 + 0.56 logio(MBH/M0) (39b) 



I.e. 



r, « 50 pc 

«35 pc 



200 km s"^ 

/ Mbh - 
\ IO^Mq 



(40a) 
(40b) 



While these relations are fairly tight, the y values show some- 
what more scatter, in the range 0.5 ^ y ^ 1 .5, and we leave y 

as a free parameter in what follows. 

Combining the relations ( l40l ) with equations ( I19bb and ^ 
gives a mass-radius relation for HCSSs in the "collisionless" 
paradigm: 



Mb 



^Gm(y) 



200 km s 

Mbh 
IO^Mq 



2.84Y-3.66 



feff 

0.1 pc 



3-7 

(4M) 



0.5607-0.680 



^eff 

0.1 pc 



3-r 



(41b) 
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Fig. 1 2. — Relation between r., the radius containing a mass in stars equal 
to twice Mbh. and central velocity dispersion {top) or black hole mass {bot- 
tom), for galaxies in the ACS Virgo sample. Open circles are "core" galaxies; 
dashed lines are fit to just these points while dotted lines are fit to the entire 
sanjple. 

wTiere 



G,,(y) = 1.93x loV'" 



750. 



Gm(y) = 1.16x lOY 



,5„-1.75 



21/(3-7) -1 
525. 

21/(3-7) -1 
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(42a) 



(42b) 



Similarly, combining equations ( l40l i with equation ( |22] | 
gives the mass-velocity dispersion relation: 



Mb 
lO^Mp 



where 



^^^s(y) 



^^m(y) 



200 km s"^ 
Mbh ^ 



10.92-2.02y / r> \ 2(7-3) 

' ^""^ ^ (43a) 



lO^Mo , 
i/s(y) = 1.92x loV''^^^ 



VlOOkms"' 

2(7-3} 



Oobs 



100 km s" 



(43b) 



]2(3-7)^ 
12(3-7) 



(44a) 
(44b) 



[1.2F3( 

//,n(y) = 1.16x 10V-'^[1.1F3( 

and 5(y) is given by equation (ITTT i. 

These relations are plotted in Figures|9]and[T0] The allowed 
locus in the reff — Mb diagram (indicated by red vertical lines) 
now includes the region occupied also by globular clusters 
and compact E galaxies. However, velocity dispersions re- 
main much higher than those observed so far in these classes 
of object. 
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3.3. Unbound stars 

The high, pre-kick stellar densities near the binary which 
are required for coalescence in the stellar-dynamical mod- 
els do not necessarily imply that these stars are bound to 
the SMBH. For instance, in a triaxial galaxy populated by 
radially-anisotropic box orbits, some of the stars that are mo- 
mentarily near SMBH will be on orbits that make them un- 
bound with respect to the hole, even before the kick. Another 
example is loss-cone repopulation by "massive perturbers"; in 
this model stars are "shot" inward to the binary on eccentric 
orbits, many with high enough velocities that they would be 
unbound in the absence of the galactic potential. 

When deriving the velocity distribution of stars near the 
SMBH, we assumed isotropy and we neglected the effect of 
the galaxy's gravitational potential on the stellar orbits. These 
assumptions would be violated in some (though not all) of the 
loss-cone repopulation mechanisms that have been invoked to 
solve the stalling problem. Here we discuss briefly the conse- 
quences of relaxing these assumptions. 

We first note that in the isotropic case, unbound stars are 
negligibly important. We verified this by constructing fully 
self-consistent models of galaxies containing SMBHs and 
counting the unbound stars at each radius. We used (isotropic) 
Dehnen (1993) models, which have an inner power-law den- 
sity profile; the self-consistent distribution function describ- 
ing the stars was computed assuming a central point with mass 
A^BH = 0.002Mgai. We confirmed, fory^ 1 and Y= 2, that the 
fraction of the mass within that is unbound for kick veloc- 
ities exceeding the escape velocity is never more than a few 
percent. 

If the pre-kick velocity distribution were radially 
anisotropic, more stars would be unbound with respect 
to the massive binary and the post-kick bound population 
would be smaller than what was computed in §2. While this 
is certainly possible, we note that the anisotropy would have 
to be appreciable at very small radii, a tenth or hundredth of 
the SMBH influence radius, in order to substantially affect 
the fraction of the population that is bound after the kick. 

For Y < 0.5, there is no isotropic distribution function that 
can reproduce the density near the SMBH (or rather, the 
isotropic distribution function would be negative at highly- 
bound energies). For such flat cores, one would need to as- 
sume a tangentially anisotropic velocity distribution. This 
would have the effect of increasing the mass of the post-kick 
bound population, since it would increase the number of stars 
on low-velocity (nearly circular) orbits at every radius. 

These uncertainties deserve to be more completely ad- 
dressed in a future paper. For now, we prefer to encapsulate 
them all in the value of y. 

3.4. Other pathways to coalescence and/or ejection 

Recoils large enough to eject SMBHs completely from 
galaxies can occur even in the absence of gravita- 
tional waves, via Newtonian interactions involvin g three 
or more massive objec ts (e.g. |Mikkola & Valtonen 1990t : 
iHoffman and Loeb 20061) . The same interactions can also 
hasten coalescence of a binary SMBH by inducing changes 
in its orbital eccentricity. If the infalling SMBH is less mas- 
sive than either of the components of the pre-existing binary, 
M3 < (Mi,M2), the ultimate outcome is likely to be ejection 
of the smaller hole and recoil of the binary, with the binary 
eventually returning to the galaxy center. If M3 > Mi or 
M3 > M2, there will most often be an exchange interaction, 
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log (r/r^) log t [yr] 

Fig. 13. — Evolution of the density around two HCSSs due to resonant 
scattering of stars into the SMBH's tidal disruption sphere. Top: Mbh = 
3 X IO'^Mq, Mb ~ 7 X IO'M:,,; bottom: Mbh = 3 x IO'M,;,, Mb ~ 1 x lO^Mg; 
Vk = 10^ km s"' in both cases. Left panels show the stellar density at Gyr 
time increments; the density drops as stars are lost into the SMBH. Right 
panels show N. 

with the lightest SMBH ejected and the two most massive 
SMBHs forming a binary; further interactions then proceed 
as in the case M3 < (Mi,M2). Whether, and to what extent, 
the ejected SMBH constitutes a HCSS depends on whether it 
carries a bound population and can retain it during interaction 
with the other SMBHs. These questions are amenable to high- 
accuracy A^-body simulations, which we hope to carry out in 
the future. 

The presence of significant amounts of cold gas in galaxy 
nuclei can also accelerate the evolution of a binary SMBH. 
However it is not clear whether the net effect of gas would be 
to increase, or decrease, the mass of a bound stellar popula- 
tion around the coalesced binary, compared with the purely 
stellar dynamical estimates made here. On the one hand, gas 
dynamical torq ues can lead to rapi d formation a tightly-bound 
binary SMBH dMaver et al. 2007h . reducing the time that the 
binary can deplete the stellar density in the core on scales of 
the SMBH influence radii. On the other hand, the formation 
of a steep Bahcall-Wolf cusp around a shrinking binary dis- 
cussed in §3 requires that the binary evolution timescale be of 
order the nuclear relaxation time. Cold gas also implies star 
formation, which could increase the number of bound stars. 

4. POST-KICK DYNAMICAL EVOLUTION 

In the colHsional regime, Mbh ^ lO^M©, a HCSS will con- 
tinue to evolve via two-body relaxation after it departs the nu- 
cleus. We argued above that the density profile around the 
SMBH will be close to the "collisionally relaxed" Bahcall- 
Wolf form, p r^^/^, at the time of the kick. After the kick, 
the Bahcall-Wolf cusp is steeply truncated at r > rt, with 
'"k ^ '"infl- Gravitational encounters will continue to drive a 
flux of stars into the tidal disruption sphere of the recoiling 
SMBH, but because there is no longer a source of stars at 
r ~ /"infl to replace those that are being lost, the density at 
r < will steadily drop, at a rate that is determined by the 
tidal destruction rate. The latter is roughly (Paper I) 

^^r^A-]h (45) 
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stars per unit time, where rt is the tidal disruption radius. 
Equation (|45] | is the so-called "resonant relaxation" loss rate 
jRauch and Tremaine 199 6*) and it differs by a factor ^ ' 
from the standard, non-resonant relaxation rate. In the case of 
SMBHs embedded in nuclei, most of the disrupted stars come 
from radii r « rinfl where resonant relaxation is not effective; 
once these stars have been removed by the kick, the stars that 
remain are almost all in the resonant regime and equation ( |45] | 
is appropriate. 

Using equation (|45] |. the condition that significant loss of 
stars take place in 10^ yr or less, i.e. 



becomes 



\_dN_ 
Vk 



10%! 



m s 



-1 



3/2 



(46) 



(47) 



This can be recast as a relation between Mb and rgff using the 
equations in the previous sections; the resulting line is plotted 
as the magenta dot-dashed curve in Figure |9] HCSSs to the 
left of this line (in the collisional regime only) should expand 
appreciably on Gyr timescales. 

To simulate the evolution of a HCSS in this regime, we 
solved the orbit-averaged isotropic Fokker-Planck equation 
for stars moving in the point-mass potential of a SMBH. In its 
standard form, based on the non-resonant angular-momentum 
diffusion coefficients, the Fokker-Planck equation would pre- 
dict evolution rates that are orders of magnitude too small. In- 
stead we used an approximate resonant diffusion coefficient 
as in Hopman & Alexander (2006). The amplitude of this 
diffusion coefficient is not known from first principles and 
we chose it to approximately reproduce the A^-body diffusion 
rates observed in Paper I and Harfst et al. (2008). Unlike 
in most applications of the Fokker-Planck equation, the outer 
boundary condition in our case is f{E = 0) = 0, i.e. the den- 
sity of stars falls to zero far from the SMBH (in addition to 
being zero near the tidal disruption sphere). 

Figure [13] shows the evolution over 10 Gyr for two HC- 
SSs with Mbh = (3 x 10*^,3 x 1O^)M0 and Vj, = 10^ km 
s^^ The first cluster lies to the left of the magenta dot- 
dashed line in Figure |9] and the second lies to the right. 
Tidal disruption rates are initially similar for the two clusters, 
lO^^yr^ ^ ^ lO^^yr^ ' , but the resultant density evolution 
is much greater in the smaller HCSS since its initial (stellar) 
mass (^ lO^^M.Tj) is less. The stellar disruption rates in Fig- 
ure [13] and their change with time, are similar to estimates 
made in a simpler way by Paper 1. 

The expansion seen in Figure[T3]is only significant for HC- 
SSs that are older than a few Gyr and that populate the left- 
most part of the mass-radius plane (Figure [9]). Furthermore, 
the theory of resonant-relaxation-driven evolution of star clus- 
ters is still in a fairly primitive form and the true evolution is 
likely to be affected in important ways by mass segregation 
and other effects that have so far hardly been studied. For 
these reasons we chose to ignore the expansion in what fol- 
lows; we note here only that the lowest-mass HCSSs are most 
likely to be affected by the expansion. We hope to return to 
this topic in more detail in later papers. 

5. LUMINOSITIES AND COLORS 

Given the total stellar mass in the cluster, we calculate the 
luminosity and color of the object using the tabulated stellar 
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Fig. 14. — Color-magnitude diagram of stellar cluster evolution tracks for 
a range of metallicities and total stellar mass of lO^M... The clusters evolve 
from the upper-left to the lower-right, with open squares corresponding to 
ages of 10^ 10', and 10'° years. 

evolution tracks of Girardi et al. (1996). These data give ab- 
solute magnitudes in [U,B,V,R,I,J,H,K] bands for a particular 
stellar (birth) mass, age, and metallicity. We assume that all 
the stars in the cluster have the same age, and the initial mass 
function (IMF ) is given by a broken power-law distribution 
(iKroupa 20011) : 
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M > 0.5Mq 



(48) 



At any given time, the total luminosity of a stellar cluster is 
generally dominated by red giants, yet the additional contri- 
butions from the main sequence stars shifts the clusters bluer 
with respect to individual giants. For example, this means that 
for a given B K color, a cluster will have a smaller value of 
B — V color than an individual giant with the same B K. 
Thus unresolved HCSSs may be initially distinguished from 
foreground stars by doing a simple cut in color-color space. 
Figure[T4]shows stellar cluster evolution on a color-magnitude 
plot, with age progressing from the upper-left (10^ years) to 
the lower-right (10 years). Here we have fixed the total clus- 
ter mass at IO^Mq, comparable to a typical GC. 

If the HCSSs have higher metallicities than typical GCs (a 
reasonable assumption if the HCSSs are simply displaced nu- 
clei), we expect them to have a significantly wider range of 
colors, which may be useful in selecting target objects photo- 
metrically from a wide field of view. In particular, since the 
recoiled star clusters are expected to be quite old, they should 
be particularly red compared to GCs of similar ages. 

Given the mass-luminosity relation of any individual 
HCSS, we can now estimate the luminosity distribution func- 
tion for a large number of sources. To arrive at an ob- 
served source count, we must first begin by calculating the 
formation rates of HCSSs via SMBH mergers. Since the 
lifetime of HCSSs is essentially the Hubble time, we need 
to integrate the cosmological merger history of the universe 
beginning at large z (> 8) up until today. While these 
merger rates are uncertain within at least an order of mag- 
nitude, most estimates share the same qualitative behavior, 
with the merger rat es as observed to d ay peaking around 
redshift z » 2 - 3 ( Menou et al. 20011: ISesana et al. 20041; 
iRhook & Whyithe 2005) and totalling ~ 10 mergers per year 
(as measured by an observer at z = 0) for Mbh > lO^M©. 
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Fig . 1 5 . — Luminosity distribution function of HCSSs per comoving Mpc' . 
For the collisionless case witli Mbh lO'A/rj, tlie curves correspond to y = 
0.5,1.0,1.5,2.0: {solid, dotted, dashed, dot-dashed). 
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Fig. 16. — Distribution of observed velocity dispersions a^bs for HCSSs 
with total luminosity L > IO^'L j. For the collisionless case with Mbh 
lO^M. , the curves correspond to •/ = 0.5, 1.0, 1.5,2.0: {solid, dotted, dashed, 
dot-dashed). 

We follow the results of Sesana et al. (2004) in estimat- 
ing merger rates as a function of total black hole mass and 
redshift. In practice, this entails defining an ad-hoc mass dis- 
tribution function of merging SMBHs with the form 



4>(M,z)^/(M)g(z), 



(49) 



where f{M) and g{z) are constructed to match the results of 
Figure 1 from Sesana et al. (2004). They find that, at any 
given redshift, the merger rate scales roughly as M^^/^ per 
log mass. This corresponds to a functional form of f{M) ^ 
M^^l'^, and is well-described by a polynomial with an ex- 
ponential cutoff at large z- Then the rate of observed mergers 
with total mass M = M\ +M2 is given by 

R{M,z)^ J^{Mi,z)^{M2,z)dMi. (50) 

To model the merger history of the universe, we follow the 
same approach as in Schnittman & Krolik (2008), integrating 
forward in time from redshift z = 8 (using a standard ACDM 
cosmology with i2m = 0.3, Q.a ~ 0.7, and h = 0.72), and 
at each redshift, generate a Monte Carlo sample of merger 
pairs, each weighted appropriately from the distribution func- 
tion <t>(M,z). We then normalize the total merger rates to re- 
produce the results of Sesana et al. (2004), as observed to- 
day. In the Monte Carlo samphng, we constrain the selected 




Fig . 1 7 . — Number density of HCSSs with a given luminosity and observed 
velocity dispersion, in units oi d~N/{d log Ldloga) per Mpc^^, for an age at 
formation of 10* years. The contour lines (from left to right) correspond to 
values of logW = (-2, -3, -4, -5,-6, -7). 

SMBHs to have a mass ratio q greater than 10^^, motivated 
by the dynamical friction timescale for the tidal stripping of 
the satellite to be less than a Hubble time. In any case, the rate 
at which HCSSs are ejected from the galaxy is not dependent 
on the precise value of the mass-ratio cutoff, since at mass 
ratios smaller than ^0.1, there is little appreciable kick. 

For each merger, if the resulting SMBH recoil is large 
enough to escape from the host galaxy, we consider it to form 
a HCSS. For a given mass ratio, the kick velocity is calcu- 
lated using equations (1-4) from Baker et al. (2008), assum- 
ing spin magnitudes in the range 0.5 < a 1,2 /Mi. 2 < 10, and 
spin orientations with a random uniform distribution. These 
assumptions are reasonable if SMBHs gain most of their 
mass through accretion (thus a relatively large spin parameter) 
and come together through dynamical friction after their host 
galaxies merge (thus random spin orientations). As pointed 
out by Bogdanovic et al. (2007), gas-rich or "wet" merg- 
ers may result in rapid alignment of the two SMBH spins, 
producing significantly smaller recoils. On the other hand, 
"dry" mergers should al low the SMBHs to retain their origi- 
nal random orientations (Schnittman 2004). However, even in 
wet mergers, a circumbinary disk may form and drive the two 
SMBHs together via gas-dynamical torque without very much 
direct accretion onto either SMBH, and therefore remain rel- 
atively dry, with correspondingly large kicks. 

6. RATES OF PRODUCTION 

From Figure [TT] we estimate that the escape velocity is 
roughly five times the nuclear stellar velocity dispersion o, 
where a is determined from the total SMBH mass from equa- 
tion (in above. If the final SMBH does escape, it will carry 
along a total mass Mb in bound stars, determined by equa- 
tions ( l34b for coUisional relaxation (Mbh ^ lO^M©) and ( l43T l 
for collisionless relaxation (Mbh lO^Mp). Neglecting mass 
loss from stellar winds and tidal disruptions, the total cluster 
mass in stars should remain roughly constant over a Hubble 
time. We used the stellar evolution tables from Girardi et al. 
(1996) to calculate the colors and luminosity of each ejected 
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(V-I)o [mag] 

Fig . 18. — Number counts of HCSSs with a given (V — I) color and abso- 
lute visual magnitude My, in units of d-N/ (dM y d{V — /)) per Mpc', with 
the same shading and contour line values as Fig. 1171 Top: star formation at 

— 0.1 Gyr; bottom: star formation at fj^ — 5 Gyr Filled circles are UCD's 
from Evstigneeva et al. (2008). Open circles are DGTO's from Hasegan et al. 
(2005). Triangles are E-galaxy nuclei from Cote et al. (2006); open triangles 
are nuclei brighter than = 13.5 and filled triangles are nuclei fainter than 
Bt = 13.5. Stars are Milky Way GCs from Harris (1996) and points are Virgo 
cluster GCs from Mieske et al. (2006). 

HCSS for two cluster ages: (1) the stars in the cluster were 
formed 10** years before SMBH merger and ejection, and (2) 
the stars formed 5 x 10^ years before ejection. We carried 
out this exercise both with a single (solar) metallicity for all 
HCSSs, and also assuming a Gaussian distribution of metal- 
licities: 
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[mag] 

Fig. 19. — Number counts of HCSSs with a given effective radius r^fi 
and absolute S'-magnitude Mk, in un its o f d^N / {dXogr^ffdMf:) per Mpc^ . 
Contour levels are the same as in Fig. [17] Data points are from Forbes et al. 
(2008). Filled circles: E galaxies. Open circles: UCDs and DGTOs. Stars: 
globular clusters. 




[mag] 

Fig. 20. — Number counts of HCSSs with a given velocity dispersion 0obs 
and absolute A'-magnitude Mk, in units of d^N /(d log gpbs dMic ) per Mpc^^ . 
Contour levels and data points are the same as in Fig. ll7l 

which correspond approximately to what Cote et al. (2006) 
inferred as the metallicity distribution of nuclear star clusters 
in the Virgo cluster galaxies, assuming ages of 5 Gyr We ob- 
tained very similar results under the two assumptions, in part 
because the Girardi et al. (1996) tables only extend to slightly 
super-solar metallicities (Z = 0.03). Figures [TsUTtI assume 
solar metallicity while Figures[T8]-l20l assume the metallicity 
distribution of equation ( fSTT l. 

Figure [15] shows the luminosity distribution function (num- 
ber per comoving Mpc^) for a range of y, with a stellar age 
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of 10 yr at time of SMBH recoil. The high-luminosity sys- 
tems correspond to high-mass SMBHs that merge via colli- 
sionless relaxation and thus the number of bound stars is di- 
rectly a function of the parameter y. In Figure [16] we plot 
the distribution as a function of observed velocity dispersion, 
limited only to systems with L > lO^L©. The low-velocity 
cutoff is directly a function of the minimum SMBH mass 
needed to keep roughly lO^M© in bound stars; any galactic 
halo with SMBH mass above ~ lO^M© will have an escape 
velocity > 400 km/s, and thus an observed velocity disper- 
sion Oobs ^ 120 km/s. The high-velocity cutoff is defined by 
the maximum kick velocity of V^ick ^ 4000 km/s, as deter- 
mined by numerical simulations of BH mergers. While the 
majority of HCSSs will have small velocity dispersions, the 
brightest, most massive ones will likely come from the most 
massive host galaxies, and thus require the largest kicks, in 
turn giving the highest internal velocity dispersions. In this 
regard, HCSSs behave similarly to more classical stellar sys- 
tems: higher masses have higher dispersion. However, hold- 
ing the SMBH mass fixed, a larger recoil velocity will result 
in a smaller number of bound stars [eqns. ( l34l l43]ll. and thus 
lower luminosity for higher velocity dispersion. 

In Figure [17] we show a contour plot of the density of HC- 
SSs as a function of luminosity and observed velocity dis- 
persion. Here we clearly see that the most luminous systems 
will also have the largest dispersion, roughly an order of mag- 
nitude greater than any globular cluster of the same stellar 
mass. We can also use Figure [TT] to estimate the number of 
HCSSs that might be observable in the local universe. As- 
suming a uniform spatial distribution in the local universe, 
out to a distance of 20 Mpc (--30,000 Mpc^), for y = 1 we 
should expect to see dozens of objects with L > lO'^L© and 
at least a few with L > lO^L©. However, an all-sky survey 
to find these few innocuous objects could be prohibitively ex- 
pensive. Coincidentally, the total mass in the Virgo cluster 
out to a radius of ~ 2 Mpc is roughly the same as that of a 
smooth universe o ut to ^ 20 Mpc, whi ch is approximately the 
distance to Virgo ("Fouqu e et al. 200T1) . In other words, a fo- 
cused survey of Virgo would be able to sample an effective 
volume of ~ 30,000 Mpc^ all at the same distance and with a 
relatively small field of view. 

We expect to find ^ 1 HCSS in Virgo with mk <20; ^ 6 
with niK < 22; ^ 40 with niK < 24; and ^ 150 with niK < 26, 
almost all of which would have Oobs ^ 200 km s^'. For the 
Fornax cluster, which is at roughly th e same distance, but con - 
tains less mass by a factor of ~ 15 (iDrinkwater et al. 200 lb . 
the source counts at the same fluxes should be down by a com- 
parable factor The Coma cluster, on the oth er hand, has a 
mass comparable to Virgo (iKubo et al. 2007h . but at a dis- 
tance of ^ 100 Mpc, the apparent brightness of any HCSSs 
will be smaller by ^ 4 magnitudes. 

Figures [18] [19] and [20] show predicted number counts in 
the color-magnitude, size-magnitude, and velocity dispersion- 
magnitude planes. Over-plotted for comparison are data for 
other compact stellar systems, from various sources, as dis- 
cussed in the figure captions. In constructing these figures (as 
well as Figure [iTli. we set y = 1 in the "collisionless" regime. 
This resulted in a slight bimodality in the distributions cor- 
responding to the discontinuous change in y from 1 .75 to 1 
at galaxy masses of ^ IO^'Mq. Alternatively, one could al- 
low y to vary in some smooth way with galaxy mass (cf. the 
discussion in §3.2). 

7. IDENTIFYING HCSSs 



7.1. Search Strategies 

We have shown that stars bound to a recoiling SMBH would 
appear as very compact stellar clusters with exceptionally 
high velocity dispersions. The density of HCSSs, and there- 
fore the chances of finding them, will be highest in clusters 
of galaxies, and nearby galaxy clusters like Virgo and Fornax 
are therefore well suited to searching for HCSSs. 

In their properties, HCSSs share similarities with globular 
clusters (GCs). However, they would differ from classical 
GCs by their much larger velocity dispersions and (possibly) 
higher metal abundances (the nuclei of elliptical galaxies in 
the Virgo and Coma clusters often have metal abundances 
comparable with solar, and the cores of quasars, powered 
by major mergers, frequently show super-solar metallicities). 
They would differ from stripped galactic nuclei and ultracom- 
pact dwarf galaxies (UCDs, Phillips et al. 2001) by their typ- 
ically greater compactness (Figure [T9] l. They would differ 
from objects in their local environment by possibly showing 
a large velocity offset. 

How can we find HCSSs and distinguish them from other 
source populations? Systematic searches would be based on 
color, compactness, spectral properties, or combinations of 
these. 

Imaging searches for compact stellar systems have been or 
are currently being carried out, focussin g especially on the 
nearest clusters of galaxies Fornax (e.g. 'H ilker et al. 19991: 
I Drinkw ater et al. 2003; Mieske et al. 2008) , Virgo (e.g . 
'Cote et al. 2004; Hasegan et al. 2005; Mieske et al. 20061: 
Firth etal. 2008), and Coma (Carter etal. 2008), and on a 
number of nearby groups (Evstig neeva et al. 20071) . These 
studies have resulted in the detection of a large number of 
GCs, and of several UCDs per galaxy cluster. 

Two strategies suggest themselves for identifying HCSSs 
among existing surveys of compact stellar systems in nearby 
galaxy clusters. One would focus on the faintest HCSSs 
which are most abundant; the other would concentrate on the 
brightest objects, which are rare, but most amenable to follow- 
up spectroscopic observations. 

According to Figure[T9] HCSSs separate in reff— luminosity 
space most strongly at the smallest effective radii, smaller 
than Galactic GCs, while Figure [18] suggests that their colors 
should be comparable to (metal rich) GCs or (gas-poor, i.e. 
non-star-forming, and non-accreting) galactic nuclei. PSF- 
deconvolved HST imaging can achieve a spatial resolution 
better than 0.1 arcsec, corresponding to spatial scales of ^ 10 
pc at the distance of the Virgo cluster However, in order 
to confirm such HCSS candidates, a laborious spectroscopic 
multi-fiber follow-up survey would then have to be carried 
out. 

Instead, selecting brighter (even though rarer) objects ap- 
pears more promising and would substantially reduce the ex- 
posure time. A key signature of a HCSS is its large veloc- 
ity dispersion, which would distinguish it from luminous GCs 
and most known UCDs. At the same time, the highest velocity 
dispersions would tend to put the broadened absorption lines 
below the noise. Therefore, HCSSs with Oobs below several 
hundred km s^' might be easiest to detect. These are in fact 
expected to be the most common (Figure [T6]l. 

In order to estimate exposure times, we simulated spectra 
with the m ulti-object spectrograph FLAMES attached to the 
VLT (Pas quini et al. 2002 ). using the spectrograph GIRAFFE 
in MEDUSA mode. This spectrograph allows the observation 
of up to 130 targets at a time at intermediate (~ 30 km s^') to 
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high 10 km ) spectral resolution. The simulated spec- 
tral deconvolutions in §2 suggest that a signal-to-noise (S/N) 
ratio of ~ 10 is sufficient to detect the broadened lines, while 
S/N w 30 is desirable in order to probe the non-Gaussianity 
of the broadening function. In order to reach a S/N of 30 for 
a cluster of my w 21 or fainter (My ^ — 10 at Virgo) requires 
excessive integration times in the high resolution mode. In 
lower resolution mode, about 10 hours exposure time are re- 
quired to reach S/N=30 for my = 20. Simply detecting the 
high velocity dispersion requires less time, of order an hour 
or less for OTv ^ 21. 

7.2. Could UCDs be HCSSs? 

HCSSs share some properties with UCDs and dwarf- 
globular transition objects ("DGTOs"; Hasegan et al. 2005). 
These are compact stellar systems with stellar velocity disper- 
sions as high as ~50 km s^\ masses bet ween 10^~^ Mp, an d 
unusually high mass-to-light ratios (e.g. iHilker et al. 20081) . 
Figures |9] and [T9l suggest that UCD sizes are consistent with 
those of the largest ("collisionless") HCSSs, although Fig- 
ures [TO] and |20] suggest that known UCD velocity dispersions 
are too low by a factor of at least a few. Furthermore Fig- 
ures [19] and |20] suggest that HCSSs with properties simi- 
lar to those of UCDs are likely to be rare. However, there 
is increasing evidence that UCDs are a "mixed bag" (e.g., 
Hilker 2006), possibly requiring a number of different forma- 
tios mechanism s (Ohetal. 1995; Fellhauer & Kroupa 2002; 
Bekki et al. 2003; .Mieske et al. 2004; .Martini & Ho 20o4 
Goerdt et al. 20q1). Individual HCSSs might therefore 
hide among the UCD population, and low-mass UCD and 
DGTO candidates identified in current and future surveys 
might some times be recoiling HCSSs. Objects like Z2109 
(IZepf et al. 2008 ) with its very unusual and broad [OIII] emis- 
sion line are also possible candidates. 

7.3. Very young HCSSs? 

The predicted HCSS colors and luminosities summarized 
in Figures [18] -|20] reflect the properties of the old (^ 5 Gyr) 
stellar populations that predominate in our models. A HCSS 
that was discovered near its birthplace might appear much 
younger than implied by these plots, particularly if it was born 
in a starburst galaxy resulting from a merger. The light from 
such a HCSS might be dominated by young massive stars, at 
least during the ~ 0. 1 — 1 Gyr required for it to exit the galaxy. 
(We are assuming here that massive stars can form, or at least 
quickly find their way, well inside the SMBH influence radius, 
as appears to be the case at the center of the Milky Way. We 
are further assuming that the recoiling SMBH is not accreting 
at the time of observation since otherwise the quasar might 
outshine the starlight.) We implicitly excluded this possibility 
above by assuming 0.1 Gyr as a minimum lag between star 
formation and SMBH ejection. 

Starting roughly 6 Myr after a starburst episode, luminosi- 
ties and colors of the burst population are expected to increase 
rapidly due to red supergiant stars (RSGs), which begin to 
appear when main sequence stars of initial mass ^ 25M0 
finish core-hydrogen burning. The absolute magnitude of a 
burst population with standard initial mass function spikes 
roughly ^ 10 Myr after the burst due to the RSGs; the in- 
crease is greatest in the IR bands. For example, in the K 
band, integrated luminosity increases to a value 100 times 
greater than for the same population at 1 Gyr (Leitherer et 
al. 1999) before falhng off rapidly after 14 Myr Inte- 
grated colors reach their reddest values at the same time, i.e. 



V-I sa 1.5. Single RSG stars can approach luminosities of 
10^ — lO^Lpj (Davies et al. 2007), so the luminosity and color 
of an HCSS containing ^ 10^ stars could undergo enormous 
relative changes around this time. 

There are both advantages and disadvantages to targetting 
such young HCSSs as a search strategy. (Super)star clusters 
are commonly found in the centers of starburst galaxies an d 
merger remnants (e.g. lMax et a l. 2005V Galliano et al. 2 008*). 
so in any search based solely on imaging photometry, HCSSs 
could not be easily distinguished. However, once a HCSS 
reaches larger separations from the center of the galaxy, it 
will already deviate from its surroundings by plausibly show- 
ing higher metal abundance. On the other hand, even low- 
resolution spectroscopy could immediately reveal a high ve- 
locity relative to the galaxy core or relative to other clusters in 
the galaxy. 

We note that discovery of a young HCSS that is still 
uniquely associated with its host galaxy can provide important 
constraints on the time scale associated with late evolution of 
the binary SMBH that engendered the kick. If coalescence 
was rapid, the host galaxy of the HCSS will still show signs of 
recent interaction (for instance tidal tails). If the massive bi- 
nary stalled for ^ 10** — 10^ yr before coalescing, these signs 
will be mostly gone. Age dating of the recoil event could be 
based on (projected) distance from the galaxy core and recoil 
velocity, and/or on the age of the stellar populations. 

7.4. Exotic manifestations 

HCSSs with accreting central black holes, powered either 
by stellar tidal disruptions or stellar mass loss, were discussed 
in Paper I. This sub-population could be efficiently searched 
for by combining optical properties with information from 
multi-wavelength surveys, e.g., in the X-ray, UV or radio 
band. We s peculate that th e unusual optical transient source 
SCP06F6 ( Barbarv et al. 2 008) might be a tidally-detonated 
white dwarf bound to a recoiling SMBH (as discussed al- 
ready in Paper I). This scenario would fit the high amplitude 
of variability of the transient, the absence of an obvious host 
galaxy, and the possible association with a cluster of galaxies 
at z = 1.1. However, the observ ed symmetry of the lightcurve 
might suggest a lensing origin (Barbary et al. 2008). The un- 
usual optical spectrum of this source could be caused by the 
tidal-debris disk illuminating the outer disk or the outflowing 
part of the detonation debris. That way, the observed extreme, 
unusally broadened absorption features would be caused if we 
are looking down-stream. Gaensicke et al. (2008) recently re- 
ported the detection of an X-ray source co-incident with SCP 
06F6 with an X-ray l uminosity at the lowe r end of known 
tidal disruption flares dKomossa et al. 20041) . These authors 
discuss supernoave-related scenarios but also consider tidal 
disruption of a star, and suggest a preliminary redshift of 0. 14, 
in which case the source is not associated with the cluster at 
redshift 1.1. 

Finally, some of the oldest surviving HCSSs would consist 
mostly of stellar end states. They would be quite faint, since 
only very low-mass stars and WDs would remain, but could 
possibly be identified by their very unusual colors. 

8. THE INVERSE PROBLEM 

We have focused on the "forward" problem of predicting 
the numbers and properties of HCSSs given reasonable as- 
sumptions about the distribution of gravitational wave kicks 
and the merger history of the universe. Once HCSSs have 
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been detected, one can begin work on the potentially more in- 
teresting inverse problem: using the measured properties of 
HCSSs to infer the distribution of GW kicks and its evolution 
over time. 

The inverse problem is made easier by the remarkable prop- 
erty of HCSSs (§2) that they encode the magnitude of their 
natal kick in their spectra. Measuring the degree to which 
the absorption-line spectrum of a HCSS has been broadened 
by internal stellar motions leads immediately to an estimate 
of V]^. Such a measurement is completely independent of the 
space velocity of the HCSS at the moment of observation. 
It is reasonably independent of the initial (pre-kick) density 
profile, and it depends on the the time since the kick only to 
the extent that the HCSS changes its structure over time; such 
changes are expected to be small for the brightest HCSSs(§4). 

For a "coUisional" (Mbh ^ IO^Mq) HCSS, with 1.75, 
equation ( 1211 1 gives 

lnF3 ==-2.17 + 0.56 X 1.75, (53) 

i.e. 

Vk « 3.3aobs- (54) 

Absent any knowledge about the internal structure of the 
HCSS, the coefficient in equation (|54] | is uncertain, but not 
greatly so. Allowing the inner density profile slope to vary 
over the range 1 < y < 2 implies 

2.9^yk/aobs^5.0. (55) 

If more information about the HCSS is available than just 
Oobs^ this estimate of could be refined, to a degree that de- 
pends on the size and distance of the HCSS and on the access 
to observing time: (1) A deep spectrum would allow extrac- 
tion of the stellar broadening function N{V), as in Figure [T] 
N{V) contains more information about the spatial and veloc- 
ity distr ibution of stars around the SMBH than Oobs alone (e.g. 
iMerritt 1993l) . (2) If the HCSS is near enough and/or large 
enough to be spatially resolved, constraints can be put on the 
slope of the stellar density profile from the photometry. 

Measuring both f\ and gives Mbh (eq- ID. allowing 
one to investigate the dependence of kick velocity on SMBH 
mass, and (via the Mbh ^ O relation) on galaxy mass. Com- 
bined with the total light of the HCSS and perhaps with a 
mass-to-light ratio derived from broad-band colors, rjj and 
Mbh give an estimate of the pre-kick nuclear density via equa- 
tion ©. 

Most detected HCSSs may be spatially unresolved. Even 
in this case, broad-band magnitudes would allow a sample 
of HCSSs to be placed on the color-magnitude or velocity 
dispersion-magnitude diagrams (Figs. [19] l20l i. The number 
of detected HCSSs per unit volume combined with their dis- 
tribution over these observational planes contains informa- 
tion about the time-integrated ejection rate, hence the galaxy 
merger rate. Colors would also provide an indirect constraint 
on the time since the kick. 

So far we have emphasized kicks lar ge enough to unbin d 
SMBHs from galaxies, Vt ^ 500 km s"' (iMerritt et al. 2004 . 
If these are the only objects detected as HCSSs then they 
will contain information only about the high-Vk part of the 
kick distribution (although we note that a large fraction of 
kicks may be above 500 km s^'). Many kicks will fall below 
galactic escape velocities, particularly in the largest galax- 
ies, producing HCSSs that oscilla te about the core or drift 
for lon g times in the en velope ( Madau and Ouataert 2004t 
iGualandris & Merritt 2008 ). Since the size of a HCSS scales 
inversely with its kick (eq. [U, such objects would be among 



the largest and brightest HCSSs, but detection might be diffi- 
cult since they would be superposed on or behind the image 
of the galaxy. Such HCSSs would also have finite lifetimes 
before finding their way back to the center of the galaxy. 

9. CONCLUSIONS 

1 . Supermassive black holes (SMBHs) kicked out from the 
centers of galaxies by gravitational wave recoil are accom- 
panied by a cluster of bound stars with mass ^ 10^^ times 
the black hole mass or less, and radius ^ lO' pc or less - a 
"hyper-compact stellar system" (HCSS). 

2. HCSSs have density profiles that can uniquely be cal- 
culated given the kick velocity and given the stellar distribu- 
tion prior to the kick. The density at large distances from the 
SMBH falls off as -r-"*. 

3. Internal (rms) velocities of HCSSs are very high, 
^ 10^ — 10^ km s^', and comparable to their kick veloci- 
ties. Their overall velocity distributions are extremely non- 
Gaussian. 

4. HCSSs could be distinguished photometrically from 
foreground red giants, based on their bluer (lower) values of 
B — V for a given B — K. They also should appear redder than 
low-metallicity GCs with comparable ages > 1 Gyr 

5. With a simplified cosmological merger model, we are 
able to estimate expected number counts and luminosity dis- 
tributions of HCSSs in the local universe. Detection of per- 
haps 10^ HCSSs should be possible in the Virgo cluster alone, 
although only a few may be bright enough to allow high S/N 
spectroscopy and provide solid confirmation of their extreme 
velocity dispersions. 

6. Some HCSSs may already exist in survey data of com- 
pact stellar sysetms in the Fornax, Virgo and Coma galaxy 
clusters. 

7. Because the kick velocity of a HCSS is related in a sim- 
ple way to its measured velocity dispersion, the distribution of 
gravitational wave kicks can be empirically determined from 
a sufficient ly large sample of HCSSs. 

Paper I (iKomossa & Merritt 2008a h first derived the basic 
properties of HCSSs including their compactness and high 
internal velocity dispersions, and presented a possible route 
to detection via off-nuclear tidal disruption flares. The cur- 
rent paper derives the intrinsic properties of HCSSs in a more 
complete and general way and relates those properties to the 
properties of the host galaxy. Together, these two papers com- 
plement the growing number of recent papers that discuss 
gas-related signatures of gravitational wave recoil. While this 
paper was being submitted, we learned of a related work by 
O'Leary & Loeb (2009) which argues that many thousands of 
low-mass HCSSs should be present in the Milky Way halo. 
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APPENDIX 

A. Computation of the Bound Mass 

Here we compute the mass in stars that remains bound to a SMBH after the latter receives an instantaneous kick of magnitude 

We assume that the pre-kick phase space density of stars is 

/o(£) =C|2£|^"^/^ -oo<2£<0 (1) 

where E = — GM^n/r is the energy per unit mass of a star The pre-kick stellar density is po('") °= r^~^- As above, we ignore 
the contribution to the gravitational potential from the stars. 

Immediately after the kick, transfer to a frame moving with the SMBH. Assume without loss of generality that the kick is along 
the X axis. In this frame, the phase space density is 



f = C 



2GMbh ^2 „2 T,2 



■-(Vx-Vkr-v;-v, 



7-3/2 



in the velocity-space region that lies within the spere 



and zero elsewhere. 
Stars are bound to the SMBH after the kick if they lie within the sphere 



(2) 



(3) 



(4) 
r 

The intersection of the surface of this sphere, and the surface of the sphere defined by equation (O, defines a circle of radius 

^^ 1/2 

2GMBH/r - (Vk/2)^ with center on the V^-axis at = yk/2 = Vo (Figure |2TJi)- 
The configuration-space density of the bound stars is then 

2GMbh 



Pk(r) = 27tC j dV, I dV, V, 



(V.-Vk) -V/ 



Y-3/2 



(5) 



where the velocity-space volume element has been written d^V = IndV^VtdVt, V? = + V?, and the region of integration 
extends over the volume enclosed by the intersection of the two spheres in Figure l2lk. 
Define new variables 



K-Vk 



^2GMBH/r' 2GMBH/r' 
The contribution to Pi^{r) from the velocity-space region to the left of the dotted curve in Figure l2Th (i.e. Vx < Vq) is 



P/(r)=7lC 



2GMbh\' 



r ) J-i 
%C /2GMbhV 



dx 



J 2 nT-3/2 

dy[\~x -y) 



2\'^-l/2 



(6) 



(7a) 



(7b) 



where = GM^ii/V^ as above; Pi — for r > 8rk. The contribution to Pk('") from the velocity-space region to the right of the 



dotted curve in Figure l2Tk (i.e. V, > Vo) is 

V / J-\/r/Si\ Jo 

TtC /2GMbh\^ fi-V^ 



—X —r/2r^ — IX'J r/lr-^ 



dy[\-x -y) 



y-1/2 V r ) }-J7J%7^ 



(y-l/2)/2 




7-1/2- 



(8a) 



(8b) 



Summing p/(r) and p//(r), and fixing C by the requirement that the post-kick density in the limit Vk ^ equal the pre-kick 
density po('"): 



yields 



Pk('-) - 
/i(Y,'-) = 



3-Y Mbh / r 



2% rl \ r. 



271/1 (y,0) rl \r, 
1 



-y 



(7-l/2)/2 



/2(Y,'-) 



r<8rk 



/r/2rk-l 



dx (i — x^y ^ , 



dx 



7-1/2 



(9) 

(10a) 
(10b) 
(10c) 
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Fig. 21. — (a) Illustrating the velocity-space region occupied by stars at radius r around a kicked SMBH, in a frame moving with the kick velocity. Stars that 
remain bound after the kick lie in the region of intersection of the two spheres. Solid lines show the density of bound stars around a kicked SMBH, immediately 
after the kick, for four different values of the pre-kick density slope, y = (0.5, 1, 1.5,2) (dotted lines), (b) Solid lines show the density of bound stars around a 
kicked SMBH, immediately after the kick, for four different values of the pre-kick density slope, y= (0.5, 1, 1.5,2) (dotted lines). 

This density is plotted in Figure l2Tb for four values of y. The integrals Ii,l 2 can be expressed in terms of hypergeometric 
functions; the quantity /i (y, 0) that appears in the denominator of equation (llOal i is 



h (y,o) = 



VSr(Y+i/2) 



(11) 



r(Y+i) 

We stress that the density expressed by equations ( fTOl i does not represent a steady-state distribution. After phase-mixing, the 
density of stars will be non-zero at all radii < r < 00 and the bound cloud will be elongated along the x-axis, as discussed 
above and as shown in Figure|2] However the fact that the density of bound stars is spherically symmetric in configuration space 
immediately after the kick allows the mass of the bound population to be straightforwardly computed: 

Mb 471 / (^rr^Pk(r). 



Writing 



as above, we have finally 



2Mb=Fi(Y)Mk, Mk = 2MBH 



3-Y 



2 (3-Y) 
/i(Y,0) 7o 



dz z^-^ \h ij,zr^) - (2z)(^-'/2)/'/2(Y,zrk) 



(12) 



(13) 



(14) 



This function is plotted as the solid line in the top panel of Figure [T] where it is compared with the approximate expression 
11.6y"'-^^ 

appendix 

B. Glossary of acronyms and variables 

BH: Black Hole 

DGTO: Dwarf-Globular Transition Object 

GC: Globular Cluster 

GH: Gauss-Hermite (expansion) 

GW: Gravitational Wave 

IMF: Initial Mass Function 

HCSS: HyperCompact Stellar System 

SMBH: SuperMassive Black Hole 

S/N: Signal-to-Noise ratio 

UCD: UltraCompact Dwarf (galaxy) 



Y: power-law index for pre-kick density scaling with radius 
A: Coulomb logarithm for 2-body relaxation 
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t,: dimensionless radius in Dehnen profile; eqn. ( fT4b 
p(r): stellar density profile around BH 
po: fiducial stellar density at ro 

a: 1 -D velocity dispersion of the pre-kick galactic bulge 
Go- width of Gaussian term in GH expansion 
Cc'- velocity dispersion for GH expansion 

aobs^ observed velocity dispersion of the post-kick HCSS; eqn. (|20] | 
T,{R): projected stellar surface density 
(|)(M): initial mass function; eqn. ( l48T l 

4>(M,z): distribution function of merging BHs with individual mass M at redshift z; eqn. ( |49] l 
Q./^: cosmological density parameter for dark energy 
Q.ra' cosmological density parameter for matter 
a: semi-major axis of pre-merger BH binary orbit 

Osq. semi-major axis of BH binary orbit at point when dominated by GW losses 
a^: semi-major axis of pre-merger bound BH binary orbit; eqn. ( |24] | 
fli : spin parameter of larger pre-merger BH 
02'. spin parameter of smaller pre-merger BH 

d: distance of passage from galactic center for collisionless loss-cone 

dsym- distance HCSS travels after kick before symmeterizing into an elongated spheriod; eqn. (fTsT i 

Fi (y): dimensionless scaling function relating Mt and M\,; eqns. (|2l|8]l 

^2(7): dimensionless scaling function relating rgff and r^; eqns. (fTTlfTsT l 

Fj{'y): dimensionless scaling function relating Oobs and V^; eqns. (ISOllSTT l 

fi,: fraction of bound stellar mass Mb relative to BH mass Mbh; eqn. ^ 

f{M): distribution function of merging BHs as a function of mass; eqn. (|49] l 

Gmiy)' dimensionless scaling function relating Mb, Mbh, and r^ff in collisionless regime; eqns. ( I4ni42] i 

Gs(y): dimensionless scaling function relating Mb, O, and reff in collisionless regime; eqns. (HTlHSl i 

g{z)'. distribution function of merging BHs as a function of redshift; eqn. ( l49l l 

h: dimensionless Hubble expansion parameter 

h4: measure of deviation from Gaussian in GH expansion 

Hmiy)' dimensionless scaling function relating Mb, Mbh, and Oobs in collisionless regime; eqns. ( I43ll44l l 

//s(y): dimensionless scaling function relating Mb, O, and Oobs in collisionless regime; eqns. (I43ll44] l 

K{'y): dimensionless scaling function relating Mb and Mbh; eqn. (I19b| i 

L: bolometric luminosity of HCSS 

Ml : mass of larger pre-merger BH 

M2'. mass of smaller pre-merger BH 

Mb: post-kick mass in bound stars; eqn. ^ 

Mbh: mass of the final, post-kick BH 

Mcore: total mass in stars ejected from galactic core; eqn. ( |29c| i 

Md: total stellar mass in Dehnen (post-kick) density profile; eqn. (flST l 

Mfc! pre-kick mass in stars within f\; eqn. ^ 

N: multiplier of o which gives escape velocity from core-Sersic galaxy; eqn. (l36l l 

A^: post-kick rate of tidal disruptions; eqn. (|45] | 

N{V): distribution function of line-of-site velocities 

q: mass ratio of binary BH M2/M1 

r. radial distance from center of stellar cluster 

R: projected radial distance from center of stellar cluster 

R\,: projected break radius for core-Sersic profile 

R{M,z)'- merger rate of binary BHs with total mass M at redshift z; eqn. (l50l l 

r,: pre-kick radius containing 2Mbh in stars 

tq: fiducial radius used to normalize pre-kick density profile 

tq: scaling radius for Dehnen density profile; eqn. ( fT4l l 

Teff: effective projected radius of post-kick stellar density profile; eqn. (fTTI i 

rinfl: influence radius; eqn. (O 

f\: kick radius; eqn. ([U 

rt: tidal disruption radius; eqn. (|45] | 

f^: time elapsed since kick 

fR! relaxation time of pre-merger stellar nucleus 

fsym- time elapsed after kick before HCSS symmeterizes into an elongated spheriod; eqn. (fT2l i 
Vesc: escape velocity from host galaxy 
Vk: initial kick velocity of merged BH 
Vk: 3-vector representation of kick velocity 
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